Robust cartogram visualization of outliers in manifold learning

نویسندگان

  • Alessandra Tosi
  • Alfredo Vellido
چکیده

Most real data sets contain atypical observations, often referred to as outliers. Their presence may have a negative impact in data modeling using machine learning. This is particularly the case in data density estimation approaches. Manifold learning techniques provide low-dimensional data representations, often oriented towards visualization. The visualization provided by density estimation manifold learning methods can be compromised by the presence of outliers. Recently, a cartogram-based representation of model-generated distortion was presented for nonlinear dimensionality reduction. Here, we investigate the impact of outliers on this visualization when using manifold learning techniques that behave robustly in their presence.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Visualizing pay-per-view television customers churn using cartograms and flow maps

Media companies aggressively compete for their share of the pay-per-view television market. Such share can only be kept or improved by avoiding customer defection, or churn. The analysis of customers’ data should provide insight into customers’ behavior over time and help preventing churn. Data visualization can be part of this analysis. Here, a database of pay-per-view television customers is ...

متن کامل

Task Taxonomy for Cartograms

Cartograms are maps in which areas of geographic regions (countries, states) appear in proportion to some variable of interest (population, income). Despite the popularity of cartograms and the large number of cartogram variants, there are few studies evaluating the effectiveness of cartograms in conveying information. In order to design cartograms as a useful visualization tool and to be able ...

متن کامل

Task Taxonomy for Cartograms

Cartograms are maps in which areas of geographic regions (countries, states) appear in proportion to some variable of interest (population, income). Despite the popularity of cartograms and the large number of cartogram variants, there are few studies evaluating the effectiveness of cartograms in conveying information. In order to design cartograms as a useful visualization tool and to be able ...

متن کامل

The analysis of residuals variation and outliers to obtain robust response surface

In this paper, the main idea is to compute the robust regression model, derived by experimentation, in order to achieve a model with minimum effects of outliers and fixed variation among different experimental runs. Both outliers and nonequality of residual variation can affect the response surface parameter estimation. The common way to estimate the regression model coefficients is the ordinar...

متن کامل

Cartogram Visualization for Bivariate Geo-Statistical Data

We describe bivariate cartograms, a technique specifically designed to allow for the simultaneous comparison of two geo-statistical variables. Traditional cartograms are designed to show only a single statistical variable, but in practice, it is often useful to show two variables (e.g., the total sales for two competing companies) simultaneously. We illustrate bivariate cartograms using Dorling...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013